Popularity Dynamics and Intrinsic Quality in Reddit and Hacker News
نویسنده
چکیده
In this paper we seek to understand the relationship between the online popularity of an article and its intrinsic quality. Prior experimental work suggests that the relationship between quality and popularity can be very distorted due to factors like social influence bias and inequality in visibility. We conduct a study of popularity on two different social news aggregators, Reddit and Hacker News. We define quality as the number of votes an article would have received if each article was shown, in a bias-free way, to an equal number of users. We propose a simple Poisson regression method to estimate this quality metric from time-series voting data. We validate our methods on data from Reddit and Hacker News, as well the experimental data from prior work. Using these estimates, we find that popularity on Reddit and Hacker News is a relatively strong reflection of intrinsic quality.
منابع مشابه
Popularity and Quality in Social News Aggregators: A Study of Reddit and Hacker News
In this paper we seek to understand the relationship between the online popularity of an article and its intrinsic quality. Prior experimental work suggests that the relationship between quality and popularity can be very distorted due to factors like social influence bias and inequality in visibility. We conduct a study of popularity on two different social news aggregators, Reddit and Hacker ...
متن کاملThe Impact of Crowds on News Engagement: A Reddit Case Study
Today, users are reading the news through social platforms. These platforms are built to facilitate crowd engagement, but not necessarily disseminate useful news to inform the masses. Hence, the news that is highly engaged with may not be the news that best informs. While predicting news popularity has been well studied, it has not been studied in the context of crowd manipulations. In this pap...
متن کاملPredicting Popularity of Posts on Hacker News CS229 Autumn 2016 Project Final Report
In this project, we try to find the best popularity predictor on the dataset of Hacker News posts. Features including n-gram, simple counts, tf-idf, word2vec and topic models and classifiers like SVM, Naive Bayes, etc are experimented. Data resampling is applied to combat the imbalance of the dataset. By combining different features, classifiers and resampling techniques, we are able to find pr...
متن کاملOnline Political Discourse in the Trump Era
We identify general trends in the (in)civility and complexity of political discussions occurring on Reddit between January 2007 and May 2017 – a period spanning both terms of Barack Obama’s presidency and the rst 100 days of Donald Trump’s presidency. We then investigate four factors that are frequently hypothesized as having contributed to the declining quality of American political discourse ...
متن کاملAn Analysis of Post Contributions in the Hacker Community
We are interested in how the posters in the hacker community contribute and exchange information. Text-mining techniques have been used to learn about the quality nature of the posts. We found that the knowledge exchanges in the hacker community are both interesting and complex. We uncover some interesting knowledge exchange behavioral patterns between initial post contributor and post repliers...
متن کامل